Weakly supervised parsing with rules
نویسندگان
چکیده
This work proposes a new research direction to address the lack of structures in traditional n-gram models. It is based on a weakly supervised dependency parser that can model speech syntax without relying on any annotated training corpus. Labeled data is replaced by a few hand-crafted rules that encode basic syntactic knowledge. Bayesian inference then samples the rules, disambiguating and combining them to create complex tree structures that maximize a discriminative model’s posterior on a target unlabeled corpus. This posterior encodes sparse selectional preferences between a head word and its dependents. The model is evaluated on English and Czech newspaper texts, and is then validated on French broadcast news transcriptions.
منابع مشابه
Semantic Graph Construction for Weakly-Supervised Image Parsing
We investigate weakly-supervised image parsing, i.e., assigning class labels to image regions by using imagelevel labels only. Existing studies pay main attention to the formulation of the weakly-supervised learning problem, i.e., how to propagate class labels from images to regions given an affinity graph of regions. Notably, however, the affinity graph of regions, which is generally construct...
متن کاملWeakly Supervised Matrix Factorization for Noisily Tagged Image Parsing
In this paper, we propose a Weakly Supervised Matrix Factorization (WSMF) approach to the problem of image parsing with noisy tags, i.e., segmenting noisily tagged images and then classifying the regions only with image-level labels. Instead of requiring clean but expensive pixel-level labels as strong supervision in the traditional image parsing methods, we take noisy image-level labels as wea...
متن کاملParsing with PCFGs
The PCFG model is without doubt the most important formal model in syntactic parsing today, not only because it is widely used in itself but also because many later developments start from it. In this lecture, I will first introduce the basic formalism (§1) and the parsing model that naturally follows from it (§2). I will then give an overview of standard techniques for parsing (§3), for superv...
متن کاملWeakly supervised training for parsing Mandarin broadcast transcripts
We present a systematic investigation of applying weakly supervised co-training approaches to improve parsing performance for parsing Mandarin broadcast news (BN) and broadcast conversation (BC) transcripts, by iteratively retraining two competitive Chinese parsers from a small set of treebanked data and a large set of unlabeled data. We compare co-training to self-training, and our results sho...
متن کاملA Weakly-Supervised Rule-Based Approach for Relation Extraction
Resumen Rule-based approaches for information extraction usually achieve good precision values, even if they often need a lot of manual effort to be implemented. In this paper, we present a novel rule-based strategy for semantic relation extraction that takes advantage of partial syntactic parsing in order to simplify the linguistic structures containing instances of semantic relations. We also...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013